Unbiased Learning to Rank Meets Reality: Lessons from Baidus Large-Scale Search Dataset
We tried to reproduce famous unbiased learning-to-rank results on the large Baidu-ULTR dataset. It turned out that it does not work as intended ... or at least that success depends on how your measure success.